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Abstract — While traditional implementations of variable- 
length digital delay lines are based on a circular buffer ac- 
cessed by two pointers, we propose an implementation where 
a single fractional pointer is used both for read and write 
operations. On modern general-purpose architectures, the 
proposed method is nearly as efficient as the popular inter- 
polated circular buffer, and it behaves well for delay-length 
modulations commonly found in digital audio effects. The 
physical interpretation of the new implementation shows 
that it is suitable for simulating tension or density mod- 
ulations in wave-propagating media. 



I. Introduction 

The digital delay line is a fundamental component of 
many signal processing architectures in several application 
fields. In audio signal processing, delay lines are used to im- 
plement audio effects, such as reverberation or pitch shift- 
ing, or to model wave propagation in physical models of 
musical instruments. 

Since the early days of computer music, the delay line 
has been proposed as a building block in software synthesis 
languages such as Music V In those implementations, 
a fixed delay of an integer number D of samples was imple- 
mented by means of a circular queue having length D. At 
each time sample, a read and then a write operations were 
performed at the location pointed by the single circulating 
pointer^. In order to have fractional lengths, linear inter- 
polation between adjacent memory locations right behind 
the pointer was introduced. 

Since dynamic variations of the delay length are required 
by many important applications, such as pitch shifting, 
variable delay lines were introduced in several signal pro- 
cessing environments and languages. The classic imple- 
mentation of the variable-length digital delay line uses a 
circular buffer, which is accessed by a writing pointer fol- 
lowed by a reading pointer^] ||. When the delay length 
has to be made variable, the relative distance between the 
reading pointer and the writing pointer is varied sample by 
sample. In order to allow for fractional lengths and click- 
free length modulation, some form of interpolation has to 
be applied at the reading point j|, 0, ||. The following 
properties should be ensured by the interpolation device: 

1. flat magnitude frequency response 
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x This is the way the delay unit generator is still implemented in 
the popular sound processing language Csound 111. 

2 This is the way the vdelay unit generator is implemented in 
Csound. 



2. linear phase response 

3. transient-free response to variations of the delay. 
FIR filters, usually in the form of Lagrange interpola- 
tors ||], [||, are widely used. Even though they can not 
satisfy property 1, they are certainly compliant to prop- 
erty 3 and can also satisfy property 2 to a great extent on 
a wide frequency range M. On the other hand, allpass fil- 
ters, often designed to have a maximally-flat delay response 
at low frequencies, satisfy property 1 exactly but are quite 
nonlinear in their phase response 0. Moreover, a rather 
complicated structure has to be devised in order to attain 
property 3 by means of allpass filters 0], @ ■ 

All of the prior realizations, as far as a fixed delay length 
is considered, are linear and time-invariant systems, thus 
being completely described by their frequency response. 
Vice versa, we are proposing a realization which is time- 
varying even in the case of constant delay. This realization 
was first proposed and implemented by the author as part 
of a thesis work Afterwards, improved implementa- 
tions and input-output analyses were sparsely presented 
at some conferences |J, This paper systematizes the 
main ideas and results, and gives some hints to use the 
technique in musical applications. 

In section |l| we recall the classic FIR realization of the 
delay line and show the magnitude and phase responses in 
the case of Lagrange quadratic interpolation. In particular, 
we show how length modulation affects the spectrum of 
an incoming signal. In section |ll| we describe the novel 
Fractionally- Addressed Delay (FAD) line, and in section |y] 
we analyze its input-output behavior in terms of signal-to- 
error ratio and transient response to delay variations. In 
section [v| we interpret the FIR and FAD lines as physical 
wave-propagating media, and we illustrate the application 
of the FAD line in physical modeling of resonators. In 
section VI the FIR and FAD lines are compared in terms of 



performance in general-purpose superscalar architectures, 
and the convenience of using FAD lines in digital audio 
effects is briefly discussed. 

II. Length-Modulated Delay Lines 

Nowadays, a plethora of techniques is available to design 
FIR filters to be used as interpolators in fractional delay 
lines 0] . Such interpolators can be designed in order to be 
optimal in one of several senses. However, most of the op- 
timal FIR interpolators are based on sets of precomputed 
coefficients or use windowing to compute the coefficients 
online. On the other hand, the coefficients of Lagrange 
interpolators are easy to compute and the resulting filter 
has the desirable property of having maximally-flat magni- 
tude in low frequency. Moreover, the magnitude response is 
guaranteed to be less than unity at any frequency, and this 
is a fundamental property if the fractional delays have to 
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be used within feedback structures, such as physical mod- 
els. For these reasons, Lagrange-interpolated delay lines 
are very popular in sound synthesis, digital audio effects, 
and wherever the delay length has to be modulated at run- 
time. In this section, we consider the Lagrange interpolator 
as a reference case, focusing on the second order FIR filter 
used e.g., in [jn]. We introduce an approximate analysis 
of the length- modulated delay that will be useful to under- 
stand the dynamic behavior of the new FAD line. 

A. Lagrange-interpolated delay lines 

A Lagrange interpolator can be characterized at any frac- 
tional delay by its magnitude and phase delay responses Q . 
At any frequency / and for a given interval of fractional 
delays, the interpolator has a magnitude response ranging 
from j4 m i n to A max and a phase delay ranging from r m - nl to 
r max . For example, a second-order (quadratic) interpolator 
can be constructed by using the coefficients M 



magnitude response as a function of D at fs/4 



h = d(l + d)/2 
hx = (l + d)(l-d) 
hi = -d(l-d)/2 



(1) 



in the FIR transfer function H(z) = ho + hiz^ 1 + li2Z~ 2 . If 
we enforce d £ [—1.0 ... 1.0], the magnitude is constrained 
to be less than one, and the phase delay deviates roughly si- 
nusoidally around a straight line going from to 2 samples. 
Figures [j].a and [j]b show the magnitude and excess phase- 
delay responses at / = F s /4 as functions of D = 1 — d. 
Ideally, one would like the excess phase delay to be zero, 
in such a way that the ideal phase delay is equal to D. 

B. Approximate analysis of length-modulated delay lines 

If the parameter d is continuously varied around 0, it is 
clear that frequency-dependent amplitude and phase mod- 
ulations are both applied to the signal. In order to un- 
derstand how these modulations affect the spectrum, we 
assume that the magnitude and phase delay vary sinu- 
soidallyf], and that the frequency of the magnitude sine is 
twice the frequency ojm of the phase-delay sine. This lat- 
ter assumption is justified by the aspect of figure |l|, where 
there are two minima in figure |l}a and one minimum in 
figure [l].b. If the input is a cosine wave at frequency ujq, 
the output signal takes form 



Vd = A m (1 + to cos 2u>Mt) cos (toot ■ 
where 

1 — A ■ 



t w sin u M t) , 



A, 



1- 
1 



A m 

- A r 



(2) 
(3) 
(4) 



and A m i n and r max are the minimum magnitude and max- 
imum delay for the given interval of variability of d and 

3 This is quite a strong assumption, but the goal of this analysis is 
just to outline a qualitative behavior of the spectrum. The actual 
behavior will be slightly different, especially because of deviations 
from zero-mean and sinusoidal variation of d. 
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Fig. 1 

Magnitude (a) and excess phase-delay (b) responses for a 
quadratic lagrange interpolator as functions of the 

PARAMETER D = 1 — d AT FREQUENCY / = F s /4 



frequency ujq. If we enforce d£ [—0.5. ..0.5], the extremal 
magnitude and phase delay responses as functions of fre- 
quency are plotted in figures ^.a and |^.b, respectively. 

The amplitude modulation gives rise to a carrier and two 
side bands: 



V d = Am COS (uJ t + T max W sin U M t) 

'" ' lm cos ((w - 2uj M )t + T max £j sinwjif*) 



(5) 



2 

mA r , 



cos ((ujq + 2u) M )t - 



x uj sinu> M t) 



The phase modulation generates infinitely many sidebands 
of the three components of amplitude modulation. How- 
ever, if we assume that the sidebands of order higher than 
two are negligible 4 , the resulting signal can be expressed 
as 

Vd = Aocosujot (6) 
— Ai cos (ujq — tdM )t + A\ cos (wo + 0Jhi)t 

4 This assumption is justified by experimental observations. 
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Fig. 2 

Extremal magnitude (a) and phase delay (b) responses for a 
quadratic lagrange interpolator 



+A 2 cos (u - 2ujM)t + A 2 cos (u + 2o>m)< 
+ ... , 



where the coefficients Ai are given by table | and J{ is 
the i-th order Bessel function of the first kind evaluated in 



A 


A! 


A 2 


A m J + mA m J 2 


A T mAm 7 


, mA m 

An ^2 + £ 



TABLE I 

Amplitude of the sidebands of an amplitude- and 
phase-modulated signal 



Figure |3| shows the magnitude of the carrier and two 
side modulation products as a function of frequency, where 
D = 1 (d = 0) and the sinusoidal length modulator has am- 
plitude 0.5 (i.e. it modulates the central part of figure p. 
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Fig. 3 

Carrier and side modulation products as a function of 
carrier frequency 



C. Delay-length dithering 

The general trend expressed by figure || is that modula- 
tion acts as a lowpass filter on the carrier and as a kind 
of highpass filter for the side components. An interesting 
point is made by moving the center of modulation slightly 
away from and observing that we get output signal com- 
ponents that are very similar to those of figure |. On the 
other hand, a fixed-length interpolated delay would exhibit 
different magnitude responses at different values of d, in 
a range bounded by the curves of figure |^.a. As a con- 
sequence, we can use delay-length modulation in order to 
have a more uniform frequency response for different values 
of d, as it is often required in applications such as physi- 
cal modeling (see section |v|). In other words, small-range 
modulation can be used as a sort of dithering in the delay 
length, and possibly put on top of long-range modulations. 
As an example, figure ^ shows the sonogram of the re- 
sponse of a non-modulated and modulated FIR line to a 
pulse train. In both cases, the delay length is slowly linearly 
increased in order to explore different fractional values of 
d. In figure ||.a the valleys corresponding to the minimum- 
magnitude curve of figure ^.a are visible as darker areas. In 
figure ^.b the darker areas are more uniformly spread along 
the horizontal axis, thus allowing magnitude compensation 
by fixed high-pass filtering. 

A different issue is the audibility of the side components 
introduced by modulation. This depends on the strength 
of the side components and on their position relative to the 
carrier signal. If these components lay below the threshold 
of masking for the carrier signal G3| they are not audible. 
If the modulation frequency is around 60 — 80Hz the side 
components turn out to be below that threshold for partials 
laying in the first 2 — 3kHz. 

III. A Fractionally-Addressed Delay Line 

An alternative realization of the delay line can be devel- 
oped by observing that a single pointer is sufficient for both 
the read and write accesses. If the delay line has fixed in- 
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Fig. 4 

Sonogram of the response of a non-modulated (a) and 

MODULATED (b) FIR DELAY LINE WITH LINEARLY-INCREASING 
LENGTH TO A PULSE TRAIN. HANNING WINDOWS OF 256 SAMPLES 
ARE USED IN ANALYSIS. MAGNITUDE (iN DB) IS SMALLER WHERE THE 
POINTS ARE DARKER. 



teger length B, it is possible to use a buffer exactly B-cells 
long and a single pointer whose entry is hrst read and then 
written. At every sample the phase pointer is incremented 
to point to the following cell. In the same buffer we can also 
implement any delay which is an integer fraction B/I just 
by incrementing the pointer at steps of / samples. We are 
going to show how this scheme can be generalized to non- 
integer fractions of the total buffer length. The resulting 
technique can be seen as an extension of the table-lookup 
oscillator |l3| ], Ejj ], with the fundamental difference that 
every read is followed by one or more writes, in such a way 
that the waveform is continuously re-stored while being 
read. 

Given a buffer size of B samples, and a sample rate F s , a 
(fractional) increment of / samples gives a delay in seconds 



delay = Q/P * lenbuf ; "/delay in samples 
framelen = floor (delay) ; 
for n=l :nf rames 

bufout = resample (buf f er , P, Q) ' ; 7 read 

output = [[output, bufout]]; 

f write (fid_out , bufout, 'intl6'); 

buf in = f read(f id_in, framelen, 'intl6'); 

buffer = resample (buf in, Q, P) ; "/write 
end 

Fig. 5 

MATLAB CODE FOR A FRAME-BASED REALIZATION OF THE FAD LINE 



equal to 



T = 



B 



I 



(7) 



Since this realization is related to waveform generation 
by fractional addressing fllf] , we call it the Fractionally- 
Addressed Delay (FAD) line. 

A. Frame- Based Realization 

The most immediate way to code the FAD line is by 
means of frame-based processing, e.g., using the resample 
function of the MATLAB Signal Processing Toolbox^. The 
code chunk of figure || is intended to be inserted into the 
framework of a frame-based digital audio effect, as pre- 
scribed by the COST-G6 action on Digital Audio Effects 
(DAFX)^ It uses a buffer of length lenbuf to imple- 
ment a delay having fractional length Q/P * lenbuf (i.e. 
/ = lenbuf /delay = P/Q). 

At every frame, the buffer is first read with a deci- 
mating factor equal to P/Q and written to the output file 
f id_out. Then framelen values of the input stream, com- 
ing from file f id_in, are interpolated with factor P/Q and 
written into the buffer. The resample operation, which 
uses a polyphase implementation of the interpolation fil- 
ters 0, pages 677-679], takes care of changing the effective 
length of the buffer to a fractional number, in such a way 
that the output stream is the delay-ed version of the in- 
put stream. According to equation (Q) , using an increment 
ranging from 2 to 1 we can implement any fractional de- 
lay ranging from B/2 to B samples, where B is the buffer 
length. For the purpose of this paper we limit the vari- 
ability of the delay length to half its nominal size. Noth- 
ing prevents to reduce the length even further, but this 
is not computationally convenient in the implementation 
presented in section HI-B| . 

The implementation of figure || shows the close relation- 
ship between sample-rate conversion EJ]] , [§) , || and delay 
interpolation. This connection can be useful to understand 
the fractional delay thoroughly. For instance, we usually 

5 MATT,AR jg a registered trademark of The MathWorks, Inc., 
http:/ /www. mathworks.com 
" Bee the COST-Ofi DAFX home page 



http :/ /echo . gaps . ssr.upm.es/COSTG6/ 
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adopt 7 > 1 because otherwise we would write a down- 
sampled (and therefore bandlimited) version of the signal 
into the buffer, thus making it impossible to recover the 
high-frequency components in the following read. 

In the implementation of figure continuous variations 
of the effective delay length can not be imposed within 
a single frame, since the interpolation factor is varied 
frame by frame. Continuous delay modulations are essen- 
tial for constructing digital audio effects such as choruses 
and hangers so that a frame-by-frame MATLAB im- 
plementation of these effects should use alternative inter- 
polating functions where the interpolation factor can be 
prescribed in a local per-sample basis (see e.g. the MAT- 
LAB function interpl). However, there are many applica- 
tions of fractional delay lines within signal processing flow- 
graphs exhibiting feedback-connected modules. In general, 
these modules can not be computed on a frame-by-framc 
basis due to circular dependencies spanning time delays 
different from the frame length. Therefore, we are mainly 
interested in devising a sample-by-sample implementation 
of the FAD line and to study its behavior in terms of signal 
accuracy and computational complexity. 

B. Sample-by- Sample Realization 

As we have mentioned, the FAD line can be interpreted 
as an extension of the table-lookup oscillator, where the 
waveform is written while it is being read. The read op- 
eration can be treated in exactly the same way as in the 
table-lookup oscillator, being possible to apply truncation, 
polynomial interpolation, or multirate interpolation tech- 
niques j|, ||. More complicated is the injection of a new 
value, to be done right after the read, in such a way that no 
"holes" are left in the current pass through the buffer. A 
fractional increment would correspond to a variable num- 
ber of writes at each step. For instance, for 7 = 1.5, three 
writes have to be performed for every couple of reads. In- 
terpolation in write has been proposed by Valimaki et al. 
in the context of digital waveguide modeling |is|| , and 
there called deinterpolation. A deinterpolator can be 
obtained from a FIR interpolator by transposing its struc- 
ture and inverting the sequence of coefficients j^, pages 
128-134]. These operations lead to a structure which is a 
sequence of cumulative additions into a delay line. Equiva- 
lently, interpolation can be performed on the input stream 
by using some extra unit delays, thus saving read and write 
operations in the buffer. This latter implementation is de- 
picted in figure [6] for the case of second-order read and 
write interpolators, with an increment 7 = 1.25. The dash- 
dotted lines in the figure should indicate how resampling is 
performed on the fly on both the read and the write side 
of the access to the buffer. If further writes have to be 
performed after the first one in order to All the blanks, the 
quantity d of figure || has to be updated to d + 1 and the 
corresponding coefficients of the writing interpolator have 
to be recomputed. 

The FAD line can be expressed by the pseudo-code of 
figure ^. Actual MATLAB and C functions implementing 
the FAD line can be found in the software repository of the 




output 



Fig. 6 

Interpolated read and write access to a circular buffer 



loop 

fph = floor (phase) ; 

output = interpolated_read(table [f ph] , 

table [fph+1] , ...); 
ph = (phase_old + 1) MOD length_table ; 
while (ph <= fph) { 

table [ph] = interpolated_write ( 
. . . , table [phase_old] , input) ; 
ph = (ph + 1) MOD length_table; 

} 

phase_old = fph; 

phase = (phase + Increment) ; 

if (phase > length_table) 

phase = phase - length_table ; 

endloop 

Fig. 7 

Pseudo-code for a sample-by-sample realization of the FAD 

LINE 



COST-G6 action on Digital Audio Effectsj]. 

Notice that the interpolatedjread uses samples follow- 
ing the phase pointer, while the interpolated_write uses 
samples preceding the pointer. 

In the following sections we are going to analyze the per- 
formance of a FAD line using quadratic interpolation in 
both read and write operations. 

IV. Input-Output Analysis 

7 The software repository can be reached from the home page of the 

riOST-Ofi P)AFX action: 

http : / / echo .gaps. ssr.upm.es/COSTG6/ 
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A. Experimental results 

The FAD line is a time-varying system, and therefore 
it is difficult to characterize in terms of frequency re- 
sponse. Figure ^| shows the magnitude spectrum of a 
delayed 5 — kHz sine wave when quadratic interpolation is 
used in reading and writing. When a sinusoidal input feeds 
the FAD line, spurious components are added to the main 
spectral line. The magnitude of these components might 
be dependent on the frequency of the input sine wave and 
the initial (fractional) phase of the FAD-line pointer []. The 
signal-to-error noise ratio (SNR) as a function of these two 
parameters shows a very mild dependence on initial phase. 
Therefore, it makes sense to plot the average SNR as a 
function of the input frequency only (figure ^|). We can 
see that low frequencies arc affected by high SNR, thus 
indicating that the FAD line has an acceptable behavior 
for practical sounds. The noise error has been computed 
as the sum of the squared differences between the input 
and output waveform samples^ |15}| . In figure ^ we have 
considered input sine waves having periods that perfectly 
divide the delay length, and a time delay that is two thirds 
of the delay length (increment is 1.5). By applying such 
SNR analysis to a low-order FIR filter we would obtain a 
curve lower than that of figure || |l^] . We do not report 
that curve here because it can be misleading. In fact, while 
the error of the FAD line can be interpreted as noise, the 
error for a FIR line is only given by phase displacement and 
magnitude attenuation, and no spurious signal components 
are introduced. 

Especially for applications such as waveguide modeling 
of musical instruments p9[ , it is important to consider the 
attenuation that different frequencies are subject to when 
fed into the delay line. For instance, the decay time of a 
frequency partial in a waveguide string depends on the at- 
tenuation of the interpolated delay line at that frequency. 
The attenuation of the main peak of the output spectrum 
turns out to be dependent on the initial phase. In order 
to have a rough idea of the attenuation property of the 
quadratically-interpolated FAD line, we plot in figure [l(] 
the minimum, maximum, and mean attenuation as a func- 
tion of frequency of the input sine wave, for the same set of 
frequencies as in figure ^, and for a delay such that / = 1.5. 
Again, direct comparison with the FIR line is problematic 
because the results would be highly dependent on the cho- 
sen delay length. 

B. Signal-to- error noise analysis 

In section || we have seen how delay-length modulations 
add side components to the peaks of a frequency spec- 
trum, as a result of modulations in magnitude and phase 
responses of the Lagrange interpolator. In the implemen- 
ts an example of dependence on initial phase, consider the in- 
crement 1 = 1. If the initial phase is the pointer always falls on 
samples. If the initial phase is 0.5 the pointer always falls between 
samples. In the two cases, the actual shape of the delayed output is 
different. 

9 A normalizing factor y^2/N has been applied, being N the number 
of samples per period. 
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Magnitude spectrum of the output signal of a FAD line, 
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Fig. 9 

EXPERIMENTAL SIGNAL-TO-ERROR NOISE RATIO VS. SINE FREQUENCY 
FOR. THE FAD LINE WITH QUADRATIC INTERPOLATION. THE INPUT 
FREQUENCIES ARE SUCH THAT THE INTERPOLATION PHASE IS A 
MULTIPLE OF 2w. 



tation of the FAD line, the magnitude and phase delay 
of the interpolators are varied sample by sample, as the 
value of d (see figure ^) is changed at every sample. If we 
exclude the degenerate cases (such as that obtained with 
increment 7=1) we can assume that, in consecutive ac- 
cesses to the buffer, the magnitude and the phase delay 
vary sinusoidally around their mean. Therefore, we can 
use the approximate analysis of section [H], where an input 
sinewave at frequency too is subject to phase modulation 
with a certain modulating frequency lum and to amplitude 
modulation with frequency 2ujm- 

The modulating frequency lom is related to the fractional 
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Fig. 10 

Attenuation of a sinusoidal input Vs. sine frequency for the 
quadratically-interpolated fad line. sampling rate is set 
equal to the buffer length and delay is set to d = 2/3s 



part // of the increment / by 

lom = 2nIfF s 



(8) 



If If ~ 0.3, the modulation products are folded into 
the high-frequency region for practical sounds, i.e. for 
sounds whose energy is mainly concentrated in a band- 
width smaller than F s /8. This means that these modula- 
tion products are less audible (especially when the sample 
rate is larger than 44kHz), and it is easier to eliminate 
them by lowpass filtering. 

As opposed to the case analyzed in section [n[ in the 
FAD line there are two modulations, one acting on input 
and the other acting on output. However, we can still use 
the values of table | to compute an estimate of the SNR of 
the FAD line, in the pessimistic case when the two modula- 
tions produced by the read and write interpolators operate 
constructively. An approximate value of this SNR is 



SNR 



(9) 



where the carrier appears at the numerator and the side- 
bands sum up at the denominator. Figure [ll] depicts this 
signal-to-error noise ratio as a function of the input fre- 
quency. The curve gives a tight lower bound to the values 
of SNR measured in simulations and reported in figure ^, 
thus justifying the assumptions taken in order to simplify 
the analysis. 

C. Behavior for time-varying delay 

The FAD line shows an unconventional behavior when 
the delay length is dynamically varied. This is illustrated 
in the following analysis through comparison with the FIR 
implementation. 

Starting from the steady state of a delay line fed with 
a stationary signal, suppose to vary the delay length as a 
linear function of time t. Namely, we start at time with 




0.4 0.6 0.8 1 1.2 
radian frequency [radian/sample] 

Fig. 11 

SlGNAL-TO-ERROR NOISE RATIO FOR THE QUADRATIC FAD LINE 



the nominal delay To and decrease it at the rate of k seconds 
per second: 

T{t) =ro-kt. (10) 

The FIR implementation responds with an instantaneous 
pitch shift in the output signal. In other words, we get a 
Doppler effect and the pitch shift is 



A/ = 1 + k . 



(11) 



On the other hand, the FAD line provides a steady pitch 
shift 



after a transient time 



A/ 



(12) 



(13) 



A similar transient is observed when the delay ramp is 
stopped. 

The transient time ( |l3| ) can be calculated by feeding the 
delay line with an impulse at time 0. It will come out of 
the line at time instant r, such that 



I(t)dt = 



B 



(14) 



where I(t) is the time-dependent increment which produces 
the desired ramp in delay length. Equation (|l4|) can be 
rewritten, using (fjj) and jlC|), as 



1 



o T o 



kt 



-dt 



(15) 



which is solved by ()l3|). 

The steady-state transposition (|l2j) can be calculated by 
observing that a second impulse entering the line at time 
Ti "sees" an instantaneous delay of r — k% seconds. It 
gets out of the line at time T °~ fc fcTi (1 — e~ k ) + Ti, exactly 
Tie~ k seconds after the impulse which entered at time 0. 

It is interesting to notice that the dynamic behavior 
of the FAD line is similar to the behavior of the analog 
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CCD delay line J2(J, which was built in MOS technology 
as a sampling system where packets of charge were shifted 
through the channel of a multigate MOSFET transistor^ 
In that case, if B was the number of gates and F s was the 
control frequency of a multiphase clock, the delay was still 
given by (^), provided that I is set to one. In order to vary 
the delay of the CCD line, F s had to be made time varying, 
and it is easy to see that ([[2]) and (|l3| ) held for that line 
as well. 

If the delay ramp is applied for 1.11 seconds starting at 
time with an empty line, the response of the FAD line to 
a steady sinusoidal input is displayed as a sonogram in fig- 
ure [l^. The transient is clearly visible in the output when 
the ramp is stopped. This sonogram should be compared 
to the one obtained with the FIR line (figure |l3|) . 
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Fig. 12 

FAD LINE: DELAY RAMP FROM 0.99 S TO 0.5 S IN 1.11 S; 1 S OF 
SINUSOIDAL INPUT 
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ulations of the quadratic FAD and FIR lines are reported 
in figure [TJ and [l^, respectively. The figures show that the 
FAD line is less sensitive to artifacts clearly visible (and 
audible) as faint waves figure [y| which are essentially due 
to modulations induced by the nonideal response of the 
interpolator. 
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Fig. 14 

FAD LINE WITH QUADRATIC INTERPOLATION: DELAY-LENGTH 
VIBRATO 
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Fig. 15 

FIR LINE WITH QUADRATIC INTERPOLATION: DELAY-LENGTH VIBRATO 
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Fig. 13 

FIR LINE: DELAY RAMP FROM 0.99 S TO 0.5 S IN 1.11 S; 1 S OF 
SINUSOIDAL INPUT 



A different behavior is also reported in response to sinu- 
soidal modulations of the delay length. These modulations 
are essential for effects such as flanging or phasing. Mod- 

10 Thanks to Giovanni De Poli for pointing out this similarity. 



V. Physical Interpretation 

If the dynamic behavior of the delay lines is closely an- 
alyzed, we see that the FAD and the FIR realizations ac- 
tually simulate two different physical phenomena. In both 
cases, the lines can be thought of as a one-dimensional 
medium where waves propagate. However, when the de- 
lay length is dynamically reduced we have two different 
physical analogies in the two cases. The shortening of the 
FIR line corresponds to the receiver getting closer to the 
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transmitter, and therefore we have a tight simulation of the 
Doppler effect. On the other hand, the shortening of the 
FAD line corresponds to increasing the speed of propaga- 
tion in the medium while maintaining the same physical 
distance between the two ends. 

Figures [l6|-p"8| illustrate what happens when using differ- 
ent implementations of the delay lines in the simulation of 
a one-dimensional waveguide resonator, such as a string. 
In this application, sketched in fig. [l^, there is a couple 
of delay lines in a feedback connection, each representing 
propagation of waves in one direction. For the sake of sim- 
plicity, the terminations are supposed to be perfectly re- 
flecting. If one of the delays is fed by three periods of a 
fast sine wave, this packet propagates, gets reflected, feeds 
the other line, and comes back to the excitation point for 
another reflection. Under ideal conditions, the packet keeps 
going back and forth without attenuation or losses. Sup- 
pose that, right after a reflection at the left end, suddenly 
the string gets lengthened by some amount, as we would 
do for lowering the pitch of the string. This can be sim- 
ulated, in the classic FIR line implementation, by moving 
the reading pointer backwards. However, this operation ex- 
poses again the wave packet which has just been reflected, 
thus modifying the "duty cycle" of the waveform, as re- 
ported on figure [l7|. A correct waveform can be obtained 
by adding a write operation right after the read to the FIR 
line implementation. This write takes care of erasing the 
waveform as it passes the reading pointer. Another way of 
lowering the pitch is that of changing the string tension in- 
stantaneously. In a waveguide simulation this corresponds 
to changing the spatial sampling via a change in the waveg- 
uide speed of propagation |nj . This can be achieved by the 
FAD line implementation just by changing the phase incre- 
ment, and the result is illustrated in figure Ell A mixture 
of tension and length increase might be obtained by using 
the FAD line with a variable buffer size. 

A different approach to dynamic tension variations in 
string models has been recently proposed by Valimaki et 
al. pi! , They recast the resampling process induced 

by tension modulation into delay length modulation of a 
FIR line controlled by the buffer content. This reformu- 
lation works under the assumption of a single observation 
point. The control circuitry is composed of a power esti- 
mator (which needs to sum the squares of the content of 
several delay cells) and a numerical integrator. 
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Fig. 16 

Waveguide model of an ideal string. A sudden pitch 

LOWERING IS DEPICTED FOR THE FIR IMPLEMENTATION OF DELAY 
LINES (DASHED LINE) AND FOR THE FAD IMPLEMENTATION OF DELAY 
LINES (DASH-DOTTED LINE) . THE STRING BEFORE THE PITCH 
TRANSITION IS REPRESENTED BY THE DOTTED LINE. 



The two ways of producing a pitch shift are not equiv- 
alent as far as timbre is concerned, as it can be deduced 
from figures [It] and [T^. In fact, the FAD-line pitch change 
produces a contraction of the whole spectrum, thus mod- 
ifying the position of formants, while the FIR-line pitch 
change comes without moving the formants. Moreover, 
when no waveform erasing is applied out of the reading 
pointer in the FIR line, the width of the main formant gets 
narrowed. On the other hand erasing after read preserves 
both the formant widths and positions. 

Summarizing, different implementations of the delay line 
do have practical consequences on the timbre produced by 
dynamically- varying waveguide models. 

VI. Computational Performance and 
Application to Digital Audio Effects 

The FAD line has only one pointer for accessing data 
in the buffer. It exhibits spatial locality because any short 
sequence of accesses spans over a small neighborhood of the 
pointed buffer cell. On the other hand, a FIR line has two 
pointers, thus exhibiting two distinct spatial localities. As 
a consequence, we expect that the FAD line makes better 
use of the cache in general purpose computer architectures. 
However, the FAD performs more writes than reads. In 
order to attain a 50% of delay variability, we have to accept 
up to two writes for each read. This overhead is partially 
compensated by the highest efficiency of write operations 
in modern architectures | p2| . 

These two observations justify the fact that the FAD 
line, despite of its higher complexity, does not run much 
slower than the FIR line on a general purpose computer. 
A benchmark for quadratically interpolated FIR and FAD 
lines has been performed on an AMD — K6 architecture 
by repeatedly delaying a soundfile stored in an array. The 
experiment was done using a Linux operating system with 
the machine in stand-alone single-user configuration. To 
avoid the effects of context switching due to the underlying 
operating system, we took the fastest of 14 repetitions, 
thus obtaining the results summarized in figure [l9L where 
running times are reported for varying buffer size. 

A first comment is about the difference in performance 
between the two algorithms. This is not as big as one might 
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Fig. 17 

Note transition waveform: FIR line without erase after 

READ 
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Fig. 18 

Note transition waveform: FAD line 



expect, especially if we consider that the quadratically- 
interpolated FAD line has about three times as many mul- 
tiplies, twice as many adds, three extra divides, and three 
times as many tests as the FIR line. A second comment is 
about the fact that the curves tend to be monotonically in- 
creasing. This indicates that more and more cache misses 
are encountered when using larger buffers. However, the 
FIR line shows an increase in capacity and conflict misses 
right after the size of 65536 samples which, when translated 
in bytes (we use 8-byte double floating point numbers), 
gives the size of the level-2 cacheP]. 

Using the FIR line and eliminating the phase increment 
it is possible to measure the overall cost of caching, wich 
is visible from figure [n] as the difference between the two 
lower curves. Since it turns out to be less than 6% even 
for very large buffers, we can argue that delay lines are not 
affected much by the memory hierarchy. Similar values of 
the percentage cost of caching have been measured in other 
architectures (e.g., the Intel Pentium II), even though the 
actual shape of the curves is slightly different. 

In our implementations, we have not done aggressive 
code optimization, so that the relative performance of the 
three realizations might vary in practice from what we have 
shown. In particular, float-to-integer conversions are ex- 
pensive, and it would be wise to perform them by direct bit 
manipulation, as suggested in |p3[ . Moreover, the memory 
access patterns show that hardware or software prefetching 
techniques might be used effectively, especially in the FAD 
line that has only one locality. 

High-quality digital audio effects, such as choruses or 
flangers, can be easily constructed around delay-modulated 
FAD lines. The audio quality is good due to go od noi se re- 
jection for time- varying delay, as shown in sec. IV-C. The 



fact that the FAD line does not respond instantaneously 
to variations of the delay length turns out to be useful 
for achieving natural sounding effects by means of random 
variations of the delay length. For instance, "random walk" 
pitch modulations can easily be achieved by low-rate ran- 
dom variations of the delay length. Namely, it is sufficient 
to call a random function, say, every 100 samples, and to 

11 And also of the memory covered by the Translation Lookaside 
Buffer, which is responsible for fast translation of virtual addresses 
to physical addresses. 



0.6 r — - 
0.55 - 
0.5 



FAD quad 
FIR quad 
FIR no inc 



_ 0.45 

in, 

<D 

E 

= 0.4 
0.35 
0.3 
0.25 



10000 



100000 



buffersize [samples] 



Fig. 19 

Performance on an AMD-K6 of the 

QUADRATICALLY-INTERPOLATED FAD LINE, FIR LINE, AND FIR LINE 
WITH NO PHASE INCREMENT, AS A FUNCTION OF BUFFER SIZE. 



perform piecewise linear interpolation between its values. 
The resulting output spectrum exhibits partials which are 
floating around without sudden changes. In the FIR im- 
plementation, a similar effect would require either a com- 
plicated modulating function to be read at audio rate or 
high-order interpolation performed on the undersampled 
control signal [ pi[ . 

VII. Conclusion 

We have proposed a realization of the digital delay line 
which is based on an extension of the table-lookup oscilla- 
tor. The proposed realization exploits the features of mod- 
ern computer architectures and shows good performance 
in terms of signal-to-error noise ratio, frequency-dependent 
attenuation and dynamic behavior. We expect this delay 
line will be considered as a building block for physically- 
based sound synthesis and for sound effects such as flangers 
and choruses. 
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